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Abstract 


Video in he digital form has huge date rate. Hence handling of digital video either 
for storage or for transmission, has become a challenge. The -way out Is to efficiently 
compress them. 

In its earliest form, digital video compression was applied to all the image data. The 
resulting images had poor definition and jerky movements accompanied by blurring. 
This might be acceptable for static images, but it is scarcely acceptable for television, 
especially for fast moving action such as sports. 

So the researchers were forced to invent a means of selective compression. Since the 
consecutive video frames are very similar In nature, a lot of compression is possible. 
In this, the parts of the overall image which did not change from frame to frame 
(such as the background, be it sky or wallpaper) could be subjected to quite severe 
compression, while fast moving picture elements were compressed to a much lesser 
extent. With the processing power available today, it has become a reality to perform 
selective compression or coding in real time. 

This project iB one such selective compression technique called els "Software based 
moving picture coder" (SBMPC). The heart of the system is the modified block trun- 
cation coder, though multiresolution sampling technique is used. Interframe coding 
technique Is used to exploit the temporal correlation between frames. DPCM tech- 
nique iB used to detect the motion between the frames. The encoder and the decoder 
is implemented on PC/AT 386 
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Chapter 1 
Introduction 


While developing an image processing system for practical applications, the first im- 
portant step is to identify clearly the overall objective. In applications for which the 
images are processed For human viewers like in television, the properties of human 
visual perception can be taken into account 

The mam objective of image coding is to represent an image with as few bits as 
possible at the same time preserving the quality and intelligibility required for the given 
applications. The levels of quality and intelligibility to be preserved, depends on the 
type of application. For example in application such as storage of image data from 
space programs, we may want to preserve all the information, while in application such 
as digital television, though high quality is important, some information in the original 
data may be destroyed, as long as the displayed image on the T.V. screen is acceptable 
to the human viewer. 

Image coding has basically two applications. The first one is reduction of channel 
bandwidth for transmission eg. Digital Television. The other application is reduction 
of storage requirements eg. Digital VCR. For transmission, compression techniques 
are more complicated because of realtime requirements, but for storage applications 
the compression technique can be less stringent because much of the preprocessing can 
be done off line. 

Image data compression methods can be classified into two basically different cat- 
egories. In the first category are those methods which exploit Redundancy in the 
data. Redundancy is a charactaristics which is related to predictability, randomness, 
smoothness, etc., in the data. For example an image of constant grey levels is fully 
predictable once the grey level of the first pixel is known On the other hand, a white 
noise random field is totally unpredictable and every pixel has to be stored to repro- 
duce the image. Thus many data compression algorithms attempt to represent a given 
sampled image array x(ij), by another array u(ij), which has no redundancy and such 
that x(ij) can be determined uniquely from u(ij). The raw data rate of u(ij) then 
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determines the data rate x(ij). Often such a process results in some compression but 
■with an accompanying distortion in the reproduced array x(y). Efficient compression 
techmques tend to minimize this distortion. 

There are various approaches to video coding. A broad classification is as follows. 

• Predictive coding. 

• Transform coding. 

• Interpolative coding. 

In predictive coding , also known as DPCM, the sample to be encoded is predicted 
from the encoded values of the previously transmitted samples and only the prediction 
error ib quantized For transmission. Such an approach can be made adaptive either 
by changing the prediction or quantization or by not transmitting the prediction error 
whenever it is below a certain threshold It is also possible to delay the encoding of a 
sample by observing the trend of the signal as indicated by certain subsequent samples. 

In transform coding^ an alternative representation of the signal is made first, by 
taking linear combinations of samples in a block of data called as coefficients and 
then quantizing the selected coefficients for transmission The primary purpose of 
transformation is to convert statistically dependant pixels into independant coefficients. 
Several transformation have been used, but the most popular one is discrete cosine 
transform (DCT). For natural pictures, DCT leads to an energy concentration in 
only a few coefficients containing the main part of the information. The coding takes 
advantage of the statistical dependencies of pixels for redundancy reduction. 

The next class of coding techmques, the interpolative coding , attempts to send 
certain samples to the reciever and interpolate all the rest. Again adaptation can be 
built in by varying the criterion for the selection of the samples to be sent and the 
strategy for interpolating the remaining samples. 

The coding can also be classified into intraframe and interframe coding In in- 
traframe coding only the first frame is coded, thus only the spatial correlation is ex- 
ploited Interframe coding is used for sequences of similar frames, including those con- 
taining moving objects. Intraframe coding removes only the spatial redundancy within 
a picture, where as interframe coding removes the temporal redundancy between the 
pictures 

Some of the standard video coding techniques such as H.261 and MPEG have been 
designed to compress video data by both reducing the spatial correlation through the 
transform coding (DCT), and the temporal correlation through the predictive coding 
(DPCM). One of the major problems of these techniques are the high computational 
complexity which inhibits these techniques for real time applications The SBMPC is a 
software based video coder and it is relatively a fast algorithm compared to transform 
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coding. The heart of the system is the modified block truncation coder. The SBMPC 
is implemented using this technique with 64 x 64 frame size. 
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Chapter 2 

Preliminary reviews 


In practice transmission channels are frequently prone to errors and when the signal 
is represented more efficiently the effect of an error becomes far more serious. Con- 
sequently, it is often necessary to add a controlled form of redundancy back into the 
signal m the form of channel encoding in order to reduce the impact of trnsmission 
errors. 


input 


Source 


Channel 


finder 


.Coder 



output 


The typical configuration then, is shown in Fig. 2 with the coding broken down 
into source encoding in which redundancy is removed from the signal for the purpose 
of achieving a more efficient representation, and channel encoding where redundancy 
is reinserted into the signal in order to obtain better channel-error performance. Off- 
course, the increase in bit rate resulting from the channel coding stage should be signif- 
icantly less than the decrease in bit rate resulting from the source encoding operation 
m order to realise a saving. 

In practice the application of picture coding to transmission channels is an economic 
tradeoff in system design, picture quality, bit rate and error performance. 

Table 1 shows the classification of the approaches that have been used for picture 
coding 
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TAHLE l: CLASSIFICATION OF PICTURE CODING 


2.1 Pulse Code Modulation 


Pulse code modulation is nothing more than a. time discrete, amplitude discrete repre- 
sentation of the signal. It is used as a digitizing scheme for purposes of transmission and 
also for digitizing before application of other, more sophisticated coding techniques. 



Figure 2.1: Pulse Code Modulation 

The basic PCM consist of sampling the waveform (usually at the nyquist rate) and 
quantizing each Bample using N levels. Each level is represented by a binary word 
containing B bits N is usually taken to be a power of two. In the decoder, these 
binary words are converted to discrete amplitude levels, and then the time sequence of 
the amplitude levels are low pass filtered. Basic PCM affords a simplicity uncommon 
to most other coders, but suffers from inefficiency since it does not use redundancy 
present m the picture signal. PCM coding systems, in general, require about 128 to 
256 levels (7 to 8 bits) for good quality pictures under most viewmg conditions. 


2.2 Predictive coding 

In predictive coding, a prediction of the sample to be encoded is made from previously 
coded information that has been transmitted. The error resulting from the subtraction 
of the prediction from the actual value of the sample is quantized into a set of discrete 
amplitude levels. These levels are represented as binary words of either fixed or variable 
word length and sent to the channel coder for transmission. Thus the predictive coder 
has two basic components: 

• Predictor. 

• quantizer 

Depending upon the number of levels of the quantizer, a distinction is often made 
between Delta modulation (DM) (N=2) and DPCM, which has N greater than 2. 
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Although. DM has been extensively used in encoding other waveforms (eg. Speech), it 
is not found great use in encoding of pictures, due perhaps to the high sampling rates 
required. 

In its simplest form, DPCM uses the coded value of the horizontally previous pixel 
as the prediction. However, more sophisticated predictors use the previous line (2-D 
predictor) as well as previous frame of information (Interframe predictor). It is the 
design of the predictors which optimizes the efficiency of the coder. 

DPCM schemes achieve compression to a large extent, by not quantizing the pre- 
diction error as finely as the original signal itself. Hence the compression in the DPCM 
entirely depends upon quantization. Several methods of optimizing quantizers have 
been studied, but quantizer design still remains an art. Most of the work on systematic 
procedures for quantizer optimization has been for the previous element DPCM coding, 
in which approximate horizontal slope of the input signal is q uant ized. 

Distortions of DPCM image coding are granular noise (grainy structure in uniform 
regions), contouring patterns, slope overload near edges (visible losses m edge) and edge 
buBynesB Reducing granular noise requires fine quantizer levels, where as attenuation 
of slope overload can be accomplished by large quantization steps. These conflicting 
requirements compel the quantizer to be designed based on art rather than on the basis 
of a mathematical model. 

Quantizers matched to the human visual sensitivity have been designed. These 
quantizers mask the DPCM distortions and generate visually acceptable images. Edgy 
busyness is defined as a noisy edge when the prediction scheme is unable to reconstruct 
the precise continuity at an edge. In interframe coding this defect appears as a blurred 
moving edge. When the number of quantization levels are too small, the images have 
layered effects, generating contouring patterns. In predictive coding a noisy channel can 
cause error propagation, hence DPCM technique for coding is suicidal against channel 
errors. 


IN. 


1 - Quuantizer 


Predictor 
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Figure 2 2. Predictive Coding 





2-3 Interpolative coding 

In. interpolative coding picture elements are subsampled and only "the subsampled el- 
ements are transmitted and the left out elements are interpolated at the reciever. In 
general interpolative coding can be classified as fixed interpolation and adaptive inter- 
polation 

In fixed interpolative methods a fixed set of picture elements are selected for trans- 
mission and the rest are interpolated. Various types of samples can be chosen for 
transmission As examples one may transmit every alternate sample, one sample out 
of four, every alternate scan line, every alternate frame The picture quality that results 
is the function of the number and type of samples that are dropped and the method 
of interpolation. Most interpolation methods have used weighted averages either using 
straight lines or higher degree polynomials. It appears that interpolation using straight 
lines is quiete effective and not much is gained by interpolation using polynomials of 
higher degree. Another method of interpolation, drops alternate fields and attempts 
to construct them by making movement of edges temporarily continuous, i.e., placing 
the edges in the dropped field at places dictated by their uniform motion between the 
adjacent transmitted field. Such technique may be useful, but in practice it has been 
found to be rather difficult to implement since it requires defination of edges and their 
motion. 

Adaptive interpolative coding consists of choosing certain points for transmission, 
constructing the interpolation of non transmitted pixels and evaluating the interpolar 
tion error. If the error is below a certain threshold, then fewer points are chosen for 
transmission. The fewest number of points that are needed to keep the Interpolation 
error below the threshold is normally the desired charactaristics. 

Performance of interpolative coders depends upon the technique used for transmit- 
ting samples and for interpolating. In terms of number of bits required for a given 
quality, interpolative coders show an improvement over nonadaptive DPCM, but they 
appear to be inferior to both adaptive DPCM and transform techniques 


2.4 Transform coding 

In trnsform coding, a picture is divided into subpictures and each BubpLCtures are 
transformed into a set of more independant coefficients. The coefficients are then 
quantized and coded for transmission. At the reciever the recieved bits are decoded 
into transform coefficients. An inverse transformation is applied to recover intensities of 
picture elements Much of the compression is a result of dropping coefficients from the 
transmission that are small and coarsely quantizing the others as required by the picture 
quality The important parameter that determine the performance of the transform 
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coder axe size and shape of subpictures, "type of "transformation used, selection of the 
coefficients to be transmitted and quantization of them, and the bit assignor which 
assigns a binary word for each of the quantizer outputs 

The transformation widely used for image coding is the DCT. DCT has the energy 
packing capabilities. The development of various fast algorithms for efficient imple- 
mentation of DCT further contributed to its popularity. 

The two important properties of DCT are orthogonality and seperability. Orthog- 
onality implies that the energy or information of a signal is preserved under transfor- 
mation (mapping into the DCT domain). Seperability implies that a multidimensional 
DCT can be implemented by a series of one- dimensional transforms The benifit of 
this property is that the fast algorithms developed for the one-dimensional DCT can 
be directly extended to multidimensional transforms. 

As an exercise I implemented DCT for video compression using following DCT 
transforms. 

tfi-iAfe-i n v 

C x [k lt k 2 ] = E E + \)cos—k 2 {2n 2 + 1) 

nt =0 f*2=0 2A U 

0 < ki < Nx - 1 

0 < k 2 < N 2 - 1 

1 fh-iih-t r 

*[«i,«z] = E E W 1 [k l ]W 2 [k 2 )C x [k lM ]cos—-k 1 (2n l + l)cos—k z (2n z + 1) 
kl=0 h=0 ^2 


WM = { J 


, &i = 0 

, 1 < h < Ni - 1 


W 2 [k 2 ] = { 


, k 2 — 0 

, 1 < kz < N 2 - 1 


A 32 x 32 image was taken and using the above equations and diagram the image was 
compressed. The main disadvantage was high computation time But the compressed 
image was of good quality. 
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The block diagram shown above was the set up used to implement DCT for video 
compression The discrete cosine transform of the image was first taken using the equar 
tions mentioned above. The coefficients of the DCT was observed. All the coefficients 
which were small were equated zero. A threshold was set, and all the values less than 
that threshold were equated to zero. Different threshold levels were tried. 

After making the small coefficients zero the inverse discrete cosine transform was 
applied and the image was displayed By varying the threshold i.e.,by changing the 
compression ratio the image quality was observed 
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Chapter 3 

Software Based Motion Picture 
Coder 


The heart of "the software based motion picture coder is the modified block truncation 
coder and the multiresolution subsampling technique. In the next section, the block 
truncation coder is described and the reason to modify the block truncation coder is 
explored, so that it can meet the requirements of software based motion picture coder 
In subsequent sections the modified block truncation coder, the SBMPC coder and 
decoder and the way it was implemented will be described. 


3.1 Block truncation coding algorithm 

In block truncation coding (BTC) the image is first divided into n x n blocks. The 
blocks are coded individually, each into a two level signal. The levels for each block are 
chosen Buchthat the first two sample moments are presevered. 

Let m = n 2 and let aci, x 2 , > x m he the values of the pixels in a block of the original 
picture. Then the first and the second sample moments and the sample variance are, 
respectively. X\ - ak, and X 2 = and o 2 = X 2 - X\. As with the 

design of any, one bit quantizer we find a threshold, Xik and two output levels a and 
b such that if X x > Xa, output = b and if X x < X^ output = a for % = 1, 2, . . . , m. 
For the quantizer, we set Xtj, = Xt. The output levels a and b are found by solving the 
following equations. Let q = number of a^’s greater than Xa,- 

Then in order to preserve XiandX 2) mXi = ( m-q)a+qb and mX 2 = (m- q)a 2 +qti t 
Solving for a and b 


a = 




n 



b = Xx + a 

Each block is then described by "the values of X\ t c, and an nxn bit plane consisting of 
l’s and 0’s indicating whether pixels are above or below X t t,. The reciever reconstructs 
the image block by calculating a and b and then assigning these values to pixels in 
accordance with the code in the bit plane. 

Illustration of block truncation coding for a 4 x 4 picture block 

• The image is first divided into small non overlapping blocks of 4 x 4. 

• The first and second sample moments are computed. 

• A bit plane is constructed such that each pixel location is coded as a "one" or a 
"zero" depending on whether that pixel is greater than X\ 

• The bit plane, X\ and cr are sent to the reciever. 

• The picture block is reconstructed such that X\ and cr are preserved That is, 
pixels m the bit plane that are "0" are set to "a" and the "l^s are set to "b" 

For example, suppose a 4 x 4 picture block is given by thje following 

' 121 114 56 47 ‘ 

_ 37 200 247 255 

Axj 16 0 12 169 

. 43 5 7 251 . 

Hence, X 1 = 98.75, a = 92.95, q = 7, and a = 16.7 ~ 17, b = 204 2 is 204. The bit 
plane is 

'1 1 00' 

0 111 
0 0 0 1 - 
. 0 0 0 1 . 

The reconstructed block becomes 

' 204 204 17 17 ' 

17 204 204 204 

17 17 17 204 

17 17 17 204 . 

and the Sample mean and Variance are preserved, 
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Because the calculations are relatively simple and storage of data small, BTC is 
fiurly easy to Implement. BTC was Implemented using 32 x 32 image. The decoded 
image quality was good. 

The advantages of BTC are listed below, and these are the reasons BTC was chosen 
as the basic kernel of SBMPC. 

• The coding speed of BTC is very fast. 

• The BTC is an edge preserving algorithm. 

• BTC retains important visual features, while discarding details which are not 
likely to be noticable by a human observer. 

Although the BTC has the benefits described above, there are some inborn defects 
existed in BTC and these are the reasons that why BTC cannot be used directly in 
SBMPC. 

• The compression ratio of BTC is not high enough to meet the bit rate requirement 
of a video coding system. 

• Although the coding speed of BTC is fast, it is still not fast enough to meet the 
real time requirement of SBMPC system. 

The modified BTC which will be described in the next section is designed to over- 
come the limitations of BTC mentioned above. 


3.2 Modified BTC 

The modification incorporated in modified BTC is mainly subsampling Like in. Block 
truncation coding, the input image is first segmented into blocks Block sizes of 4 x 4, 
8 x 8 or 16 x 16 are generally utilized in image coding. 

These small block sizes enableB us to introduce adaptive features based on activity 
or detail in the block. Although there may be some correlation between neighbouring 
blocks, this is too insignificant to warrant large block size beyond 16 x 16. 

In modified BTC bLock size of 8 x 8 is used. After the image is segmented into 
blocks, the pixels within the blocks are subsampied. The sub sampling process in- 
creases compression ratio as well as increases the computation speed. Depending upon 
the requirement of computation speed, compression ratio and quality of picture, the 
subsampling rate can be chosen. Also the subsampling rate can be made adaptive 
according to the changes in the picture frame. Furthermore, a simplified algorithm is 
used for computing the quantizers threshold and the two reconstruction levels for each 
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Figure 3.1: Modified BTC 


subsampled blocks compared “to the algorithm used in the block truncation coding. 
Finally the subeampled pixels are quantized using two level quantizer and then a bit 
plane is formed which along with the reconstruction levels are transmitted. 

The decoder of the modified BTC recieves the bitplane along with reconstruction 
levels. Then it reconstructs the quantized pixels and interpolates the pixels eliminated 
by the Bubsample process, synthesizes and generates an output image. 

The block diagram of Encoder of modified BTC is shown in Figure 3.1. 

The encoder of modified block truncation coder consists of blocker, subsampler, 
derimator, quantizer parameter decider and the two level quantizer. The blocker divides 
the image into several 8x8. blocks. If the original image data is OX(n), the blocker 
divides the original image data OX(n) into blocks X(n) with size 8x8 These blocks 
are then sent into the subsampler. Let X(n) and SX (n) denote the input and output 
data of subsampler, respectively. Then the function performed by subsampler is as 
follows. 

3X{n) = X{k x n) 

where, k is a positive integer. 

The array below shows the function performed by subsampler. 
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X is the input to the subsampler and SX is the output of the subsampler with k 
set to 3 The framed pixels are selected by the subsampler. 

The subsampler output is fed into the quantizer parameter decider which generates 
the threshold QT and the two reconstruction levels. In the BTC the original blocks 
are used- to generate QT and the two reconstruction levels, but in modified BTC the 
subsampled blocks are used to generate QT and the two reconstruction levels. This 
modification is done so as to increase the computation speed. 

The parameters QT and the two reconstruction levels RL(0) and RL(l) are defined 
by the following equations 


1 * 1 0 S »<[*1 

which means it is the average value of the subsampled pixels. 

1 


RL(0) 


r . $sx(i)<qtSX (») , 


£f*i Ssx « & o*<ryi 

RL(l) = — — j 53 8sx(x)>qtS X (t) 

L^)Ssx>qt 

where k is the value chosen at the subsampler and 

*/ = 1 


when / is true, otherwise 


5 / =0 


In otherwords RL(0) is the average value of subsampled pixels which are less than QT, 
and RL(1) is the average of subsampled pixels which are greater than or equal to QT 
The function of dedmator is similar to subsample Function of subsampler. The two 
level quantizer quantizes the decimated image DX(n), using the quantizer threshold 
QT. After quantization a bit plane is formed. The bit plane B(n) output by the two 
level quantizer is defined by 


B(n) = S DX {n)>qr 

where 5/ = 1 when / ib true, otherwise 5j = 0 

The bit plane is then multiplexed with the reconstruction levels RL(0) and RL(1) 
and sent to the modified BTC decoder. 

At the decoder the bitplane is converted back to decimated pixels using the two 
reconstruction levels, i.e., the inverse quantizer performs an inverse function of two 
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level quantizer. Because the image pixels reconstructed by the inverse quantizer are 
decimated pixels, the pixels eliminated by the decimator must be recovered by the 
interpolator. Straight line interpolation was used For implementation. Finally the 
recovered image blocks are synthesized as a single frame to form an output image 


3.3 Implementation of modified BTC 

The modified BTC was implemented using 32 x 32 eye image, on a PC/AT 386 machine 
using C language programming 

First the image was divided into 8x8 non overlapping blocks. To illustrate the 
steps involved in modified BTC the first block of the image is reproduced below 

‘ 118 113 85 101 161 151 151 163 ‘ 

118 85 101 151 140 144 157 162 

89 90 141 146 140 138 145 166 

84 141 158 134 138 138 143 155 

129 161 140 144 147 137 140 156 * 

171 145 152 160 153 141 138 157 

154 147 162 170 151 133 129 155 

. 150 161 174 180 143 133 138 168 _ 

Next step is to subsample each block Subsampling was earned out using k = 3. 
Reproduced below is the first block of the image eye after subsampling 

[118, 101, 151,85, 140, 162, 141,138,84, 134, 143, 

161, 147, 156, 152, 141, 154, 170, 129, 161, 143, 168] 

Threshold QT = 146.61 c* 147 

The two reconstruction levels are 

RL{ 0) = 124.75 ~ 125, 

RL( 1) = 158 19 - 158, 


and the bit plane 

[ 0 , 0 , 1 , 0 , 0 , 1 , 0 , 0 , 0 , 0 , 0 , 1 , 1 , 1 , 1 , 0 , 1 , 1 , 0 , 1 , 0 , 11 . 

The bit plane alongwith the two reconstruction levels were transmitted. 

At the reciever, the recieved bit plane was reconstructed using the two reconstruc- 
tion levels as shown below.. 

[125, 125, 158, 125, 125, 158, 125, 125, 125, 125, 125, 

158, 158, 158, 158, 125, 158, 158, 125, 158, 125, 158] 
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Next step is to interpolate the left out values during decimation, A simple straight 
line interpolation was used to interpolate the left out values. 

Shown below is the first block after interpolation. 


' 125 

125 

125 

125 

136 

147 

158 

147 ‘ 

136 

125 

125 

125 

125 

136 

147 

158 

147 

136 

125 

125 

125 

125 

125 

125 

125 

125 

125 

125 

125 

125 

125 

136 
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158 

158 

158 

158 

158 

158 

158 

158 

158 

158 

147 

136 

125 

136 

147 

158 

158 

158 

158 

147 

136 

125 

136 

. 147 

158 

147 

136 

125 

136 

147 

158 


Finally all the blocks were reorganized into a single frame and the reconstructed image 
was displayed. 

3.4 SBMPC SYSTEM 

The block diagram of SBMPC encoder is in Figure below. The input image is first 
partitioned into image blocks by the blocker. Then these image blocks are sent to the 
PCM and the motion detector. When the image sequence contains a scene change 
or some objects in the image sequence move quickly, the image would contain lot of 
information and must be detected. The funcuon of the motion detecter is to detect 
for any movement in the present frame compared to the previous frame. To detect the 
movement DP CM technique is used. 

The motion detector compares a block in the present frame with the corresponding 
block in the previous frame and detects whether there was any motion The multires- 
olution sampler gets the image blocks sent by the PCM along with the information 
regarding the movement. Then the blocks are divided into several activity classes 

Mult resolution 



Figure 3.2: Block diagram of SBMPC encoder 
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Figure 3 3' Block diagram of SBMPC decoder 

The activity of an image block is defined by 

ACT = RL{ 0) - RL( 1) 

where, RL(0) and RL( 1) are defined by the equations for reconstruction levels in the 
Modified BTC. 

Depending upon the activity, the multireBolution sampler changes the subsampling 
rate in the modified BTC. The modified BTC sub samples each block with different 
subsampling rate depending upon the activity. When the objects m the video are 
moving, it is more important to trace the actions of the objects fast and clearly, than 
to describe the contents of these objects in detail. Because the image blocks which 
contain a part of moving objects usually have higher activity values than the blocks 
which contains a part of the backround , the multiresolution sampler should ensure that 
the modified BTC codes the higher activity blocks with higher sampling rate and lower 
activity blocks with lower sampling rate. 

Hence the modified BTC codes different blocks with different subsampling rate 
and sends the output data which consists of bitplane, reconstruction levels and the 
information regarding the subsampling rate for each block to the SBMPC decoder 

The block diagram of SBMPC decoder is as shown in Fig. 3 3 

The decoder recieves the bitplane, two reconstruction levels and the information 
regarding the subsampling rate for each block. The bitplanes of each block is recon- 
structed using two reconstruction levels, i.e., zeroe’s in the bitplane are replaced by 
-Rl/(0)’s and one’s in the bitplane are replaced by RL(l)'s. 

After reconstructing the bitplane, the decoder checks for the information, at what 
rate the block was subsampled at the encoder. Depending upon the subsampling rate 
used at the encoder, the blocks are interpolated suitably 
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3.5 Implementation 

The SBMPC was implemented on PC/AT 386 using ’C’ language programming. Since 
the SBMPC system is for moving pictures, several frames of continuous sequence was 
necessary. Fifteen frames of a sequence of a person moving his head from left to right 
was used These frames were initially stored in the computer as Framel, Frame2, . 
Framel5. Frame size of 64 x 64 was used. 

The incoming picture frames are divided into several nonoverlapping blocks. Block 
size of 8 x 8 is used. Since the frame size is 64 x 64 each frame will be divided into 64 
blocks of block size 8x8 After dividing the picture frame into blocks, the program 
checks whether a particular block belongs to Framel or not. If the block is From framel, 
modified BTC with k = 1 is applied to that block, and the corresponding bit plane and 
the two reconstruction levels are transmitted. All the blocks of framel are similarly 
coded and transmitted. 

However, if the program finds that a particular block is not from framel, it has 
to first of all detect whether there was any movement in that block compared to the 
corresponding block in the previous frame. If there is a moevment or change, the 
program has to also find the amount of change detected. For this purpose, the blocks 
are divided into three activity classes. They are Activity class x, Activity class y ) 
Activity class z 

The flow charts for the implementation of SBMPC encoder is shown m the following 
page. 

To detect the movement in a particular block, the difference between that block and 
the corresponding block in the previous frame is calculated. Then the activity of that 
block is given by ACT = RL(0) - RL( 1). The blocks were divided into three activity 
classes as mentioned earlier. Activity x, very 1 Little or no change 

ACT x < 10 

Activity y. moderate change 

10 < ACT y < 25 

Activity z =, large change 

ACT z > 25. 

For all the blocks with activity x, modified BTC with k = 5 is applied and for all 
the blocks with activity y, modified BTC with k = 3 is applied and finally for all the 
blocks with activity z, modified BTC with k = 1 is applied. 

The modified block truncation coder generates corresponding bit plane and recon- 
struction levels for each block which are transmitted 

The decoder recieves the bitplane, two reconstruction levels and the information 
regarding the subsampling rate The first thing that the decoder does is to reconstruct 
the bitplane by replacing Os in the bitplane by 72L(0)s and Is in the bitplane by R.L(l)s, 
Once the reconstruction is completed the decoder has to interpolate the left out pixels 
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SBMPC Transmitter 



transmit 




during the decimation operation. Since the subsampling rate is different for different 
blocks, the interpolator should adapt itself according to the subsampling rate. 

To simplify the interpolation operation, what was done is, each block was checked 
for its subsampling rate which is known by the value of k. If k is found to be 3 three 
zeroes were padded inbetween all the values of reconstructed block. If k is found to be 
5, five zeroes were padded inbetween all the values of reconstructed block. However if 
k is found to be 1, no padding necessary. ' 

Finally using a simple straight line interpolation the left out values are interpolated 
and is illustrated in the next page. 




x(0) x(l) x(2) x(3) 


3/(0) and 3/(3) are known. 3/(1) and y(2) are unknown and are -to be interpolated. 


2 /( 3 ) 

2 /( 0 ) 
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All the fifteen frameB were encoded by the SBMPC and the decoded image wag 
displayed, and the compression ratio was calculated The quality of the decoded image 
were satisfactory, but can still be improved by using better interpolation technique and 
may be multilevel quantization. 
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Chapter 4 
Conclusions 


Software based motion picture coder is computationally high speed algorithm with high 
compression ratio. The main advantage of SBMPC is that it is easy to implement. 
The heart of the system is the modified BTC which employs subsampling process 
which can both accelerate the computations and achieve higher compression ratio. 
Furthermore, a simplified algorithm is used for computing the quantizer threshold and 
the two reconstruction levels for each subsampled block 

The quality of decoded image is not of high quality, mainly because in modified BTC 
two level quantizer are used at the coder and a simple straight line interpolator Is used 
at the decoder. To improve the quality of the decoded image multilevel quantization 
can be used at the coder and at the decoder still better interpolation technique can be 
employed Median filter can be used as a interpolater. The mam advantage is it gives 
good results around sharp edges. 

The compression ratio is given by 

Compressionratio = ~ 

For SBMPC compression ratio for all the fifteen frames were calculated and it was 
found that on an average the compression ratio was found to be 25. However, the 
compression ratio can be changed depending upon the threshold set For the activity 
classes. A trade off between compression ratio and quality of the decoded image can 
be arrived at. 

When the computation power of PC is inceased, the performance of SBMPC can 
be improved in several ways. First, some simple and fast motion detection algorithm 
can be used to remove the temporal redundancy further. Secondly, some fast transform 
algorithm ran be used to remove the spatial redundancy. Third, entropy coding can 
be introduced into SBMPC system to encode data more compactly. Finally, the frame 
rate can be increased and the frame size can be enlarged 
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